Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 17689 |
| Missing cells | 5287 |
| Missing cells (%) | 1.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.8 MiB |
| Average record size in memory | 168.0 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 7 |
| Boolean | 2 |
entry_age is highly overall correlated with years_between_high_school_and_college | High correlation |
from_public_school is highly overall correlated with quota | High correlation |
num_approved_credits is highly overall correlated with num_failed_credits and 3 other fields | High correlation |
num_credits is highly overall correlated with num_credits_reference_semester and 1 other fields | High correlation |
num_credits_reference_semester is highly overall correlated with num_credits | High correlation |
num_disciplines is highly overall correlated with num_credits | High correlation |
num_failed_credits is highly overall correlated with num_approved_credits and 1 other fields | High correlation |
num_nonattendance_credits is highly overall correlated with num_approved_credits and 1 other fields | High correlation |
quota is highly overall correlated with from_public_school | High correlation |
semester_average is highly overall correlated with num_approved_credits and 3 other fields | High correlation |
status is highly overall correlated with num_approved_credits and 1 other fields | High correlation |
years_between_high_school_and_college is highly overall correlated with entry_age | High correlation |
race is highly imbalanced (52.6%) | Imbalance |
marital_status is highly imbalanced (66.9%) | Imbalance |
has_college_degree is highly imbalanced (91.5%) | Imbalance |
race has 2521 (14.3%) missing values | Missing |
quota has 1783 (10.1%) missing values | Missing |
from_public_school has 478 (2.7%) missing values | Missing |
years_between_high_school_and_college has 462 (2.6%) missing values | Missing |
years_between_high_school_and_college has 514 (2.9%) zeros | Zeros |
semester_average has 1220 (6.9%) zeros | Zeros |
num_approved_credits has 3165 (17.9%) zeros | Zeros |
num_dispensed_credits has 16089 (91.0%) zeros | Zeros |
num_failed_credits has 10167 (57.5%) zeros | Zeros |
num_nonattendance_credits has 13352 (75.5%) zeros | Zeros |
num_locked_credits has 16767 (94.8%) zeros | Zeros |
num_exams has 8423 (47.6%) zeros | Zeros |
Reproduction
| Analysis started | 2024-06-03 18:28:05.131967 |
|---|---|
| Analysis finished | 2024-06-03 18:28:58.956030 |
| Duration | 53.82 seconds |
| Software version | ydata-profiling v4.8.3 |
| Download configuration | config.json |
entry_age
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 58 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 9 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.893552 |
| Minimum | 15 |
|---|---|
| Maximum | 86 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.3 KiB |
Quantile statistics
| Minimum | 15 |
|---|---|
| 5-th percentile | 17 |
| Q1 | 18 |
| median | 21 |
| Q3 | 28 |
| 95-th percentile | 47 |
| Maximum | 86 |
| Range | 71 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 9.7287659 |
|---|---|
| Coefficient of variation (CV) | 0.39081469 |
| Kurtosis | 2.3766237 |
| Mean | 24.893552 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.68629 |
| Sum | 440118 |
| Variance | 94.648887 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 18 | 2649 | |
| 19 | 2216 | |
| 17 | 2178 | |
| 20 | 1518 | 8.6% |
| 21 | 1058 | 6.0% |
| 22 | 853 | 4.8% |
| 23 | 636 | 3.6% |
| 24 | 545 | 3.1% |
| 25 | 422 | 2.4% |
| 26 | 421 | 2.4% |
| Other values (48) | 5184 |
| Value | Count | Frequency (%) |
| 15 | 1 | < 0.1% |
| 16 | 173 | 1.0% |
| 17 | 2178 | |
| 18 | 2649 | |
| 19 | 2216 | |
| 20 | 1518 | |
| 21 | 1058 | 6.0% |
| 22 | 853 | 4.8% |
| 23 | 636 | 3.6% |
| 24 | 545 | 3.1% |
| Value | Count | Frequency (%) |
| 86 | 1 | < 0.1% |
| 75 | 1 | < 0.1% |
| 72 | 4 | < 0.1% |
| 69 | 1 | < 0.1% |
| 68 | 4 | < 0.1% |
| 67 | 9 | |
| 66 | 4 | < 0.1% |
| 65 | 4 | < 0.1% |
| 64 | 5 | < 0.1% |
| 63 | 15 |
gender
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 138.3 KiB |
| Female | |
|---|---|
| Male |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.1464752 |
| Min length | 4 |
Characters and Unicode
| Total characters | 91036 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Male |
|---|---|
| 2nd row | Male |
| 3rd row | Female |
| 4th row | Male |
| 5th row | Male |
Common Values
| Value | Count | Frequency (%) |
| Female | 10140 | |
| Male | 7549 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| female | 10140 | |
| male | 7549 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 27829 | |
| a | 17689 | |
| l | 17689 | |
| F | 10140 | 11.1% |
| m | 10140 | 11.1% |
| M | 7549 | 8.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 73347 | |
| Uppercase Letter | 17689 | 19.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 27829 | |
| a | 17689 | |
| l | 17689 | |
| m | 10140 | 13.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 10140 | |
| M | 7549 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 91036 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 27829 | |
| a | 17689 | |
| l | 17689 | |
| F | 10140 | 11.1% |
| m | 10140 | 11.1% |
| M | 7549 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 91036 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 27829 | |
| a | 17689 | |
| l | 17689 | |
| F | 10140 | 11.1% |
| m | 10140 | 11.1% |
| M | 7549 | 8.3% |
race
Categorical
IMBALANCE  MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2521 |
| Missing (%) | 14.3% |
| Memory size | 138.3 KiB |
| White | |
|---|---|
| Mixed_race | |
| Black | |
| Prefer_not_to_declare | 546 |
| Yellow | 66 |
Length
| Max length | 21 |
|---|---|
| Median length | 5 |
| Mean length | 6.1937632 |
| Min length | 5 |
Characters and Unicode
| Total characters | 93947 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | White |
|---|---|
| 2nd row | White |
| 3rd row | White |
| 4th row | White |
| 5th row | White |
Common Values
| Value | Count | Frequency (%) |
| White | 11287 | |
| Mixed_race | 1837 | 10.4% |
| Black | 1408 | 8.0% |
| Prefer_not_to_declare | 546 | 3.1% |
| Yellow | 66 | 0.4% |
| Indigenous | 24 | 0.1% |
| (Missing) | 2521 | 14.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| white | 11287 | |
| mixed_race | 1837 | 12.1% |
| black | 1408 | 9.3% |
| prefer_not_to_declare | 546 | 3.6% |
| yellow | 66 | 0.4% |
| indigenous | 24 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 17235 | |
| i | 13148 | |
| t | 12379 | |
| W | 11287 | |
| h | 11287 | |
| a | 3791 | 4.0% |
| c | 3791 | 4.0% |
| _ | 3475 | 3.7% |
| r | 3475 | 3.7% |
| d | 2407 | 2.6% |
| Other values (15) | 11672 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 75304 | |
| Uppercase Letter | 15168 | 16.1% |
| Connector Punctuation | 3475 | 3.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 17235 | |
| i | 13148 | |
| t | 12379 | |
| h | 11287 | |
| a | 3791 | 5.0% |
| c | 3791 | 5.0% |
| r | 3475 | 4.6% |
| d | 2407 | 3.2% |
| l | 2086 | 2.8% |
| x | 1837 | 2.4% |
| Other values (8) | 3868 | 5.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 11287 | |
| M | 1837 | 12.1% |
| B | 1408 | 9.3% |
| P | 546 | 3.6% |
| Y | 66 | 0.4% |
| I | 24 | 0.2% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3475 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 90472 | |
| Common | 3475 | 3.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 17235 | |
| i | 13148 | |
| t | 12379 | |
| W | 11287 | |
| h | 11287 | |
| a | 3791 | 4.2% |
| c | 3791 | 4.2% |
| r | 3475 | 3.8% |
| d | 2407 | 2.7% |
| l | 2086 | 2.3% |
| Other values (14) | 9586 |
Common
| Value | Count | Frequency (%) |
| _ | 3475 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 93947 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 17235 | |
| i | 13148 | |
| t | 12379 | |
| W | 11287 | |
| h | 11287 | |
| a | 3791 | 4.0% |
| c | 3791 | 4.0% |
| _ | 3475 | 3.7% |
| r | 3475 | 3.7% |
| d | 2407 | 2.6% |
| Other values (15) | 11672 |
marital_status
Categorical
IMBALANCE 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 34 |
| Missing (%) | 0.2% |
| Memory size | 138.3 KiB |
| Single | |
|---|---|
| Married | |
| Divorced | 435 |
| Others | 369 |
| Widowed | 73 |
Length
| Max length | 17 |
|---|---|
| Median length | 6 |
| Mean length | 6.1794959 |
| Min length | 6 |
Characters and Unicode
| Total characters | 109099 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Single |
|---|---|
| 2nd row | Single |
| 3rd row | Single |
| 4th row | Single |
| 5th row | Single |
Common Values
| Value | Count | Frequency (%) |
| Single | 14822 | |
| Married | 1929 | 10.9% |
| Divorced | 435 | 2.5% |
| Others | 369 | 2.1% |
| Widowed | 73 | 0.4% |
| Legally_separated | 27 | 0.2% |
| (Missing) | 34 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| single | 14822 | |
| married | 1929 | 10.9% |
| divorced | 435 | 2.5% |
| others | 369 | 2.1% |
| widowed | 73 | 0.4% |
| legally_separated | 27 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 17709 | |
| i | 17259 | |
| l | 14876 | |
| g | 14849 | |
| S | 14822 | |
| n | 14822 | |
| r | 4689 | 4.3% |
| d | 2537 | 2.3% |
| a | 2010 | 1.8% |
| M | 1929 | 1.8% |
| Other values (14) | 3597 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 91417 | |
| Uppercase Letter | 17655 | 16.2% |
| Connector Punctuation | 27 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 17709 | |
| i | 17259 | |
| l | 14876 | |
| g | 14849 | |
| n | 14822 | |
| r | 4689 | 5.1% |
| d | 2537 | 2.8% |
| a | 2010 | 2.2% |
| o | 508 | 0.6% |
| v | 435 | 0.5% |
| Other values (7) | 1723 | 1.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 14822 | |
| M | 1929 | 10.9% |
| D | 435 | 2.5% |
| O | 369 | 2.1% |
| W | 73 | 0.4% |
| L | 27 | 0.2% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 27 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 109072 | |
| Common | 27 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 17709 | |
| i | 17259 | |
| l | 14876 | |
| g | 14849 | |
| S | 14822 | |
| n | 14822 | |
| r | 4689 | 4.3% |
| d | 2537 | 2.3% |
| a | 2010 | 1.8% |
| M | 1929 | 1.8% |
| Other values (13) | 3570 | 3.3% |
Common
| Value | Count | Frequency (%) |
| _ | 27 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 109099 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 17709 | |
| i | 17259 | |
| l | 14876 | |
| g | 14849 | |
| S | 14822 | |
| n | 14822 | |
| r | 4689 | 4.3% |
| d | 2537 | 2.3% |
| a | 2010 | 1.8% |
| M | 1929 | 1.8% |
| Other values (14) | 3597 | 3.3% |
quota
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1783 |
| Missing (%) | 10.1% |
| Memory size | 138.3 KiB |
| OC | |
|---|---|
| L05 | |
| L01 | |
| L06 | |
| L02 |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.4170124 |
| Min length | 2 |
Characters and Unicode
| Total characters | 38445 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | OC |
|---|---|
| 2nd row | OC |
| 3rd row | OC |
| 4th row | OC |
| 5th row | OC |
Common Values
| Value | Count | Frequency (%) |
| OC | 9273 | |
| L05 | 2243 | 12.7% |
| L01 | 2172 | 12.3% |
| L06 | 1109 | 6.3% |
| L02 | 1109 | 6.3% |
| (Missing) | 1783 | 10.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| oc | 9273 | |
| l05 | 2243 | 14.1% |
| l01 | 2172 | 13.7% |
| l06 | 1109 | 7.0% |
| l02 | 1109 | 7.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 9273 | |
| C | 9273 | |
| L | 6633 | |
| 0 | 6633 | |
| 5 | 2243 | 5.8% |
| 1 | 2172 | 5.6% |
| 6 | 1109 | 2.9% |
| 2 | 1109 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 25179 | |
| Decimal Number | 13266 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6633 | |
| 5 | 2243 | 16.9% |
| 1 | 2172 | 16.4% |
| 6 | 1109 | 8.4% |
| 2 | 1109 | 8.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 9273 | |
| C | 9273 | |
| L | 6633 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25179 | |
| Common | 13266 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 6633 | |
| 5 | 2243 | 16.9% |
| 1 | 2172 | 16.4% |
| 6 | 1109 | 8.4% |
| 2 | 1109 | 8.4% |
Latin
| Value | Count | Frequency (%) |
| O | 9273 | |
| C | 9273 | |
| L | 6633 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38445 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| O | 9273 | |
| C | 9273 | |
| L | 6633 | |
| 0 | 6633 | |
| 5 | 2243 | 5.8% |
| 1 | 2172 | 5.6% |
| 6 | 1109 | 2.9% |
| 2 | 1109 | 2.9% |
from_public_school
Boolean
HIGH CORRELATION  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 478 |
| Missing (%) | 2.7% |
| Memory size | 34.7 KiB |
| True | |
|---|---|
| False | |
| (Missing) | 478 |
| Value | Count | Frequency (%) |
| True | 12543 | |
| False | 4668 | 26.4% |
| (Missing) | 478 | 2.7% |
has_college_degree
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.4 KiB |
| False | |
|---|---|
| True | 188 |
| Value | Count | Frequency (%) |
| False | 17501 | |
| True | 188 | 1.1% |
shift
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 138.3 KiB |
| Morning_Afternoon | |
|---|---|
| Night | |
| Full_time | |
| Afternoon | |
| Afternoon_Night | 623 |
Length
| Max length | 17 |
|---|---|
| Median length | 15 |
| Mean length | 12.189214 |
| Min length | 5 |
Characters and Unicode
| Total characters | 215615 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Morning_Afternoon |
|---|---|
| 2nd row | Morning_Afternoon |
| 3rd row | Morning_Afternoon |
| 4th row | Morning_Afternoon |
| 5th row | Morning_Afternoon |
Common Values
| Value | Count | Frequency (%) |
| Morning_Afternoon | 8663 | |
| Night | 4113 | |
| Full_time | 2658 | 15.0% |
| Afternoon | 1544 | 8.7% |
| Afternoon_Night | 623 | 3.5% |
| Morning | 88 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| morning_afternoon | 8663 | |
| night | 4113 | |
| full_time | 2658 | 15.0% |
| afternoon | 1544 | 8.7% |
| afternoon_night | 623 | 3.5% |
| morning | 88 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 39162 | |
| o | 30411 | |
| r | 19581 | |
| t | 18224 | |
| i | 16145 | |
| e | 13488 | 6.3% |
| g | 13487 | 6.3% |
| _ | 11944 | 5.5% |
| f | 10830 | 5.0% |
| A | 10830 | 5.0% |
| Other values (7) | 31513 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 176696 | |
| Uppercase Letter | 26975 | 12.5% |
| Connector Punctuation | 11944 | 5.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 39162 | |
| o | 30411 | |
| r | 19581 | |
| t | 18224 | |
| i | 16145 | |
| e | 13488 | 7.6% |
| g | 13487 | 7.6% |
| f | 10830 | 6.1% |
| l | 5316 | 3.0% |
| h | 4736 | 2.7% |
| Other values (2) | 5316 | 3.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 10830 | |
| M | 8751 | |
| N | 4736 | |
| F | 2658 | 9.9% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 11944 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 203671 | |
| Common | 11944 | 5.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 39162 | |
| o | 30411 | |
| r | 19581 | |
| t | 18224 | |
| i | 16145 | |
| e | 13488 | 6.6% |
| g | 13487 | 6.6% |
| f | 10830 | 5.3% |
| A | 10830 | 5.3% |
| M | 8751 | 4.3% |
| Other values (6) | 22762 |
Common
| Value | Count | Frequency (%) |
| _ | 11944 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 215615 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 39162 | |
| o | 30411 | |
| r | 19581 | |
| t | 18224 | |
| i | 16145 | |
| e | 13488 | 6.3% |
| g | 13487 | 6.3% |
| _ | 11944 | 5.5% |
| f | 10830 | 5.0% |
| A | 10830 | 5.0% |
| Other values (7) | 31513 |
years_between_high_school_and_college
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 51 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 462 |
| Missing (%) | 2.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.005979 |
| Minimum | 0 |
|---|---|
| Maximum | 78 |
| Zeros | 514 |
| Zeros (%) | 2.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3 |
| Q3 | 8 |
| 95-th percentile | 22 |
| Maximum | 78 |
| Range | 78 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 7.311533 |
|---|---|
| Coefficient of variation (CV) | 1.2173757 |
| Kurtosis | 5.40749 |
| Mean | 6.005979 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 2.1586312 |
| Sum | 103465 |
| Variance | 53.458515 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 5077 | |
| 2 | 2363 | |
| 3 | 1546 | 8.7% |
| 4 | 1053 | 6.0% |
| 5 | 870 | 4.9% |
| 6 | 717 | 4.1% |
| 7 | 582 | 3.3% |
| 0 | 514 | 2.9% |
| 8 | 473 | 2.7% |
| 9 | 404 | 2.3% |
| Other values (41) | 3628 | |
| (Missing) | 462 | 2.6% |
| Value | Count | Frequency (%) |
| 0 | 514 | 2.9% |
| 1 | 5077 | |
| 2 | 2363 | |
| 3 | 1546 | 8.7% |
| 4 | 1053 | 6.0% |
| 5 | 870 | 4.9% |
| 6 | 717 | 4.1% |
| 7 | 582 | 3.3% |
| 8 | 473 | 2.7% |
| 9 | 404 | 2.3% |
| Value | Count | Frequency (%) |
| 78 | 1 | < 0.1% |
| 49 | 1 | < 0.1% |
| 48 | 2 | < 0.1% |
| 47 | 2 | < 0.1% |
| 46 | 6 | |
| 45 | 3 | < 0.1% |
| 44 | 3 | < 0.1% |
| 43 | 4 | < 0.1% |
| 42 | 10 | |
| 41 | 6 |
fundamental_area
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 138.3 KiB |
| Philosophy_Human | |
|---|---|
| Exact_Technology | |
| Health_Biological | |
| Literature_Arts | |
| Agricultural |
Length
| Max length | 17 |
|---|---|
| Median length | 16 |
| Mean length | 15.68144 |
| Min length | 12 |
Characters and Unicode
| Total characters | 277389 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Agricultural |
|---|---|
| 2nd row | Agricultural |
| 3rd row | Agricultural |
| 4th row | Agricultural |
| 5th row | Agricultural |
Common Values
| Value | Count | Frequency (%) |
| Philosophy_Human | 6088 | |
| Exact_Technology | 4263 | |
| Health_Biological | 3031 | |
| Literature_Arts | 2854 | |
| Agricultural | 1453 | 8.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| philosophy_human | 6088 | |
| exact_technology | 4263 | |
| health_biological | 3031 | |
| literature_arts | 2854 | |
| agricultural | 1453 | 8.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 26764 | 9.6% |
| l | 22350 | 8.1% |
| a | 20720 | 7.5% |
| h | 19470 | 7.0% |
| t | 17309 | 6.2% |
| i | 16457 | 5.9% |
| _ | 16236 | 5.9% |
| c | 13010 | 4.7% |
| e | 13002 | 4.7% |
| u | 11848 | 4.3% |
| Other values (15) | 100223 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 227228 | |
| Uppercase Letter | 33925 | 12.2% |
| Connector Punctuation | 16236 | 5.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 26764 | |
| l | 22350 | |
| a | 20720 | 9.1% |
| h | 19470 | 8.6% |
| t | 17309 | 7.6% |
| i | 16457 | 7.2% |
| c | 13010 | 5.7% |
| e | 13002 | 5.7% |
| u | 11848 | 5.2% |
| r | 11468 | 5.0% |
| Other values (7) | 54830 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 9119 | |
| P | 6088 | |
| A | 4307 | |
| T | 4263 | |
| E | 4263 | |
| B | 3031 | 8.9% |
| L | 2854 | 8.4% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 16236 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 261153 | |
| Common | 16236 | 5.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 26764 | 10.2% |
| l | 22350 | 8.6% |
| a | 20720 | 7.9% |
| h | 19470 | 7.5% |
| t | 17309 | 6.6% |
| i | 16457 | 6.3% |
| c | 13010 | 5.0% |
| e | 13002 | 5.0% |
| u | 11848 | 4.5% |
| r | 11468 | 4.4% |
| Other values (14) | 88755 |
Common
| Value | Count | Frequency (%) |
| _ | 16236 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 277389 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 26764 | 9.6% |
| l | 22350 | 8.1% |
| a | 20720 | 7.5% |
| h | 19470 | 7.0% |
| t | 17309 | 6.2% |
| i | 16457 | 5.9% |
| _ | 16236 | 5.9% |
| c | 13010 | 4.7% |
| e | 13002 | 4.7% |
| u | 11848 | 4.3% |
| Other values (15) | 100223 |
num_disciplines
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 50 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.3100797 |
| Minimum | 1 |
|---|---|
| Maximum | 66 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 5 |
| median | 6 |
| Q3 | 7 |
| 95-th percentile | 9 |
| Maximum | 66 |
| Range | 65 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 3.4449556 |
|---|---|
| Coefficient of variation (CV) | 0.54594486 |
| Kurtosis | 49.092555 |
| Mean | 6.3100797 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 6.1233791 |
| Sum | 111619 |
| Variance | 11.867719 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 6901 | |
| 6 | 5304 | |
| 7 | 2794 | |
| 4 | 756 | 4.3% |
| 8 | 601 | 3.4% |
| 9 | 191 | 1.1% |
| 3 | 161 | 0.9% |
| 10 | 120 | 0.7% |
| 2 | 110 | 0.6% |
| 11 | 90 | 0.5% |
| Other values (40) | 661 | 3.7% |
| Value | Count | Frequency (%) |
| 1 | 26 | 0.1% |
| 2 | 110 | 0.6% |
| 3 | 161 | 0.9% |
| 4 | 756 | 4.3% |
| 5 | 6901 | |
| 6 | 5304 | |
| 7 | 2794 | |
| 8 | 601 | 3.4% |
| 9 | 191 | 1.1% |
| 10 | 120 | 0.7% |
| Value | Count | Frequency (%) |
| 66 | 1 | < 0.1% |
| 54 | 1 | < 0.1% |
| 48 | 1 | < 0.1% |
| 47 | 2 | |
| 46 | 3 | |
| 45 | 1 | < 0.1% |
| 44 | 2 | |
| 43 | 3 | |
| 42 | 3 | |
| 41 | 4 |
num_credits
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 158 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.5357 |
| Minimum | 2 |
|---|---|
| Maximum | 243 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.3 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 20 |
| median | 22 |
| Q3 | 26 |
| 95-th percentile | 35 |
| Maximum | 243 |
| Range | 241 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 13.06481 |
|---|---|
| Coefficient of variation (CV) | 0.53248165 |
| Kurtosis | 49.963833 |
| Mean | 24.5357 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 6.0780578 |
| Sum | 434012 |
| Variance | 170.68926 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 5309 | |
| 24 | 1954 | 11.0% |
| 22 | 1847 | 10.4% |
| 26 | 1411 | 8.0% |
| 21 | 961 | 5.4% |
| 18 | 784 | 4.4% |
| 23 | 710 | 4.0% |
| 29 | 692 | 3.9% |
| 31 | 486 | 2.7% |
| 35 | 405 | 2.3% |
| Other values (148) | 3130 |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 3 | 5 | < 0.1% |
| 4 | 21 | 0.1% |
| 6 | 12 | 0.1% |
| 7 | 6 | < 0.1% |
| 8 | 92 | |
| 9 | 12 | 0.1% |
| 10 | 43 | 0.2% |
| 11 | 15 | 0.1% |
| 12 | 204 |
| Value | Count | Frequency (%) |
| 243 | 1 | |
| 220 | 1 | |
| 216 | 1 | |
| 204 | 1 | |
| 179 | 1 | |
| 178 | 1 | |
| 176 | 1 | |
| 170 | 1 | |
| 169 | 2 | |
| 167 | 1 |
semester_average
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 967 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.8709893 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 1220 |
| Zeros (%) | 6.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3.96 |
| median | 7.01 |
| Q3 | 8.23 |
| 95-th percentile | 9.17 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 4.27 |
Descriptive statistics
| Standard deviation | 3.0134864 |
|---|---|
| Coefficient of variation (CV) | 0.51328425 |
| Kurtosis | -0.69701535 |
| Mean | 5.8709893 |
| Median Absolute Deviation (MAD) | 1.57 |
| Skewness | -0.82005056 |
| Sum | 103851.93 |
| Variance | 9.0811001 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1220 | 6.9% |
| 0.12 | 143 | 0.8% |
| 8.1 | 96 | 0.5% |
| 8 | 94 | 0.5% |
| 8.2 | 93 | 0.5% |
| 8.4 | 92 | 0.5% |
| 8.6 | 92 | 0.5% |
| 8.5 | 89 | 0.5% |
| 8.3 | 88 | 0.5% |
| 7.7 | 81 | 0.5% |
| Other values (957) | 15601 |
| Value | Count | Frequency (%) |
| 0 | 1220 | |
| 0.02 | 13 | 0.1% |
| 0.03 | 2 | < 0.1% |
| 0.04 | 10 | 0.1% |
| 0.05 | 12 | 0.1% |
| 0.06 | 13 | 0.1% |
| 0.07 | 11 | 0.1% |
| 0.08 | 13 | 0.1% |
| 0.1 | 20 | 0.1% |
| 0.11 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 12 | |
| 9.94 | 2 | < 0.1% |
| 9.93 | 1 | < 0.1% |
| 9.9 | 5 | |
| 9.88 | 1 | < 0.1% |
| 9.87 | 1 | < 0.1% |
| 9.85 | 4 | < 0.1% |
| 9.84 | 1 | < 0.1% |
| 9.83 | 2 | < 0.1% |
| 9.82 | 5 |
num_approved_credits
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 41 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.680536 |
| Minimum | 0 |
|---|---|
| Maximum | 60 |
| Zeros | 3165 |
| Zeros (%) | 17.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 6 |
| median | 18 |
| Q3 | 21 |
| 95-th percentile | 29 |
| Maximum | 60 |
| Range | 60 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 9.3807463 |
|---|---|
| Coefficient of variation (CV) | 0.63899209 |
| Kurtosis | -0.9604064 |
| Mean | 14.680536 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -0.28079946 |
| Sum | 259684 |
| Variance | 87.998401 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3165 | |
| 20 | 3085 | |
| 16 | 1128 | 6.4% |
| 22 | 1107 | 6.3% |
| 24 | 970 | 5.5% |
| 12 | 826 | 4.7% |
| 18 | 700 | 4.0% |
| 8 | 668 | 3.8% |
| 4 | 647 | 3.7% |
| 14 | 552 | 3.1% |
| Other values (31) | 4841 |
| Value | Count | Frequency (%) |
| 0 | 3165 | |
| 1 | 15 | 0.1% |
| 2 | 238 | 1.3% |
| 3 | 82 | 0.5% |
| 4 | 647 | 3.7% |
| 5 | 52 | 0.3% |
| 6 | 275 | 1.6% |
| 7 | 72 | 0.4% |
| 8 | 668 | 3.8% |
| 9 | 118 | 0.7% |
| Value | Count | Frequency (%) |
| 60 | 1 | < 0.1% |
| 43 | 1 | < 0.1% |
| 41 | 1 | < 0.1% |
| 38 | 1 | < 0.1% |
| 36 | 3 | < 0.1% |
| 35 | 175 | |
| 34 | 1 | < 0.1% |
| 33 | 4 | < 0.1% |
| 32 | 11 | 0.1% |
| 31 | 333 |
num_dispensed_credits
Real number (ℝ)
ZEROS 
| Distinct | 145 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.9328962 |
| Minimum | 0 |
|---|---|
| Maximum | 231 |
| Zeros | 16089 |
| Zeros (%) | 91.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 16 |
| Maximum | 231 |
| Range | 231 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 13.681898 |
|---|---|
| Coefficient of variation (CV) | 4.6649787 |
| Kurtosis | 54.258435 |
| Mean | 2.9328962 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.6923553 |
| Sum | 51880 |
| Variance | 187.19433 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16089 | |
| 4 | 178 | 1.0% |
| 8 | 122 | 0.7% |
| 12 | 109 | 0.6% |
| 16 | 57 | 0.3% |
| 24 | 49 | 0.3% |
| 10 | 42 | 0.2% |
| 20 | 42 | 0.2% |
| 6 | 41 | 0.2% |
| 14 | 39 | 0.2% |
| Other values (135) | 921 | 5.2% |
| Value | Count | Frequency (%) |
| 0 | 16089 | |
| 2 | 33 | 0.2% |
| 3 | 15 | 0.1% |
| 4 | 178 | 1.0% |
| 5 | 11 | 0.1% |
| 6 | 41 | 0.2% |
| 7 | 18 | 0.1% |
| 8 | 122 | 0.7% |
| 9 | 11 | 0.1% |
| 10 | 42 | 0.2% |
| Value | Count | Frequency (%) |
| 231 | 1 | |
| 204 | 1 | |
| 202 | 1 | |
| 200 | 1 | |
| 162 | 1 | |
| 159 | 1 | |
| 157 | 1 | |
| 156 | 1 | |
| 155 | 1 | |
| 152 | 1 |
num_failed_credits
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 33 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.8878964 |
| Minimum | 0 |
|---|---|
| Maximum | 40 |
| Zeros | 10167 |
| Zeros (%) | 57.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 6 |
| 95-th percentile | 20 |
| Maximum | 40 |
| Range | 40 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 5.9485236 |
|---|---|
| Coefficient of variation (CV) | 1.5300108 |
| Kurtosis | 1.8752018 |
| Mean | 3.8878964 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.632099 |
| Sum | 68773 |
| Variance | 35.384933 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 10167 | |
| 4 | 2329 | 13.2% |
| 8 | 1056 | 6.0% |
| 20 | 714 | 4.0% |
| 12 | 535 | 3.0% |
| 10 | 439 | 2.5% |
| 6 | 420 | 2.4% |
| 16 | 336 | 1.9% |
| 2 | 267 | 1.5% |
| 14 | 248 | 1.4% |
| Other values (23) | 1178 | 6.7% |
| Value | Count | Frequency (%) |
| 0 | 10167 | |
| 2 | 267 | 1.5% |
| 3 | 199 | 1.1% |
| 4 | 2329 | 13.2% |
| 5 | 159 | 0.9% |
| 6 | 420 | 2.4% |
| 7 | 139 | 0.8% |
| 8 | 1056 | 6.0% |
| 9 | 68 | 0.4% |
| 10 | 439 | 2.5% |
| Value | Count | Frequency (%) |
| 40 | 2 | < 0.1% |
| 36 | 1 | < 0.1% |
| 32 | 4 | < 0.1% |
| 30 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 28 | 3 | < 0.1% |
| 27 | 6 | < 0.1% |
| 26 | 8 | < 0.1% |
| 25 | 5 | < 0.1% |
| 24 | 35 |
num_nonattendance_credits
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 33 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.6261518 |
| Minimum | 0 |
|---|---|
| Maximum | 35 |
| Zeros | 13352 |
| Zeros (%) | 75.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 17 |
| Maximum | 35 |
| Range | 35 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 5.7505567 |
|---|---|
| Coefficient of variation (CV) | 2.1897274 |
| Kurtosis | 5.6960311 |
| Mean | 2.6261518 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.4629794 |
| Sum | 46454 |
| Variance | 33.068902 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 13352 | |
| 4 | 1101 | 6.2% |
| 8 | 481 | 2.7% |
| 12 | 333 | 1.9% |
| 16 | 292 | 1.7% |
| 10 | 241 | 1.4% |
| 6 | 238 | 1.3% |
| 18 | 225 | 1.3% |
| 20 | 211 | 1.2% |
| 14 | 211 | 1.2% |
| Other values (23) | 1004 | 5.7% |
| Value | Count | Frequency (%) |
| 0 | 13352 | |
| 1 | 1 | < 0.1% |
| 2 | 166 | 0.9% |
| 3 | 79 | 0.4% |
| 4 | 1101 | 6.2% |
| 5 | 75 | 0.4% |
| 6 | 238 | 1.3% |
| 7 | 50 | 0.3% |
| 8 | 481 | 2.7% |
| 9 | 35 | 0.2% |
| Value | Count | Frequency (%) |
| 35 | 16 | 0.1% |
| 31 | 21 | 0.1% |
| 30 | 20 | 0.1% |
| 29 | 17 | 0.1% |
| 28 | 4 | < 0.1% |
| 27 | 2 | < 0.1% |
| 26 | 61 | |
| 25 | 31 | |
| 24 | 38 | |
| 23 | 29 |
num_locked_credits
Real number (ℝ)
ZEROS 
| Distinct | 25 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4082198 |
| Minimum | 0 |
|---|---|
| Maximum | 29 |
| Zeros | 16767 |
| Zeros (%) | 94.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 29 |
| Range | 29 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.0591277 |
|---|---|
| Coefficient of variation (CV) | 5.0441643 |
| Kurtosis | 44.342248 |
| Mean | 0.4082198 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.2475523 |
| Sum | 7221 |
| Variance | 4.2400071 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16767 | |
| 4 | 307 | 1.7% |
| 8 | 122 | 0.7% |
| 12 | 77 | 0.4% |
| 6 | 74 | 0.4% |
| 10 | 57 | 0.3% |
| 2 | 50 | 0.3% |
| 14 | 39 | 0.2% |
| 3 | 29 | 0.2% |
| 11 | 26 | 0.1% |
| Other values (15) | 141 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 16767 | |
| 2 | 50 | 0.3% |
| 3 | 29 | 0.2% |
| 4 | 307 | 1.7% |
| 5 | 11 | 0.1% |
| 6 | 74 | 0.4% |
| 7 | 20 | 0.1% |
| 8 | 122 | 0.7% |
| 9 | 10 | 0.1% |
| 10 | 57 | 0.3% |
| Value | Count | Frequency (%) |
| 29 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 24 | 3 | < 0.1% |
| 22 | 4 | < 0.1% |
| 21 | 5 | < 0.1% |
| 20 | 15 | |
| 19 | 10 | |
| 18 | 19 | |
| 17 | 1 | < 0.1% |
| 16 | 22 |
num_credits_reference_semester
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 20 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.736955 |
| Minimum | 12 |
|---|---|
| Maximum | 40 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.3 KiB |
Quantile statistics
| Minimum | 12 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 20 |
| median | 22 |
| Q3 | 26 |
| 95-th percentile | 35 |
| Maximum | 40 |
| Range | 28 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 5.2316264 |
|---|---|
| Coefficient of variation (CV) | 0.22040006 |
| Kurtosis | 0.80521125 |
| Mean | 23.736955 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.0476569 |
| Sum | 419883 |
| Variance | 27.369915 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 5619 | |
| 24 | 2229 | 12.6% |
| 22 | 1720 | 9.7% |
| 26 | 1289 | 7.3% |
| 21 | 877 | 5.0% |
| 29 | 774 | 4.4% |
| 18 | 562 | 3.2% |
| 28 | 537 | 3.0% |
| 36 | 517 | 2.9% |
| 23 | 506 | 2.9% |
| Other values (10) | 3059 |
| Value | Count | Frequency (%) |
| 12 | 103 | 0.6% |
| 14 | 160 | 0.9% |
| 16 | 387 | 2.2% |
| 18 | 562 | 3.2% |
| 20 | 5619 | |
| 21 | 877 | 5.0% |
| 22 | 1720 | 9.7% |
| 23 | 506 | 2.9% |
| 24 | 2229 | 12.6% |
| 25 | 498 | 2.8% |
| Value | Count | Frequency (%) |
| 40 | 246 | 1.4% |
| 36 | 517 | |
| 35 | 454 | 2.6% |
| 34 | 472 | 2.7% |
| 31 | 418 | 2.4% |
| 30 | 289 | 1.6% |
| 29 | 774 | |
| 28 | 537 | |
| 27 | 32 | 0.2% |
| 26 | 1289 |
num_exams
Real number (ℝ)
ZEROS 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0134547 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 8423 |
| Zeros (%) | 47.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 138.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.2278521 |
|---|---|
| Coefficient of variation (CV) | 1.2115511 |
| Kurtosis | 0.60308495 |
| Mean | 1.0134547 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.138134 |
| Sum | 17927 |
| Variance | 1.5076209 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 8423 | |
| 1 | 4097 | |
| 2 | 2742 | 15.5% |
| 3 | 1578 | 8.9% |
| 4 | 653 | 3.7% |
| 5 | 176 | 1.0% |
| 6 | 20 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 8423 | |
| 1 | 4097 | |
| 2 | 2742 | 15.5% |
| 3 | 1578 | 8.9% |
| 4 | 653 | 3.7% |
| 5 | 176 | 1.0% |
| 6 | 20 | 0.1% |
| Value | Count | Frequency (%) |
| 6 | 20 | 0.1% |
| 5 | 176 | 1.0% |
| 4 | 653 | 3.7% |
| 3 | 1578 | 8.9% |
| 2 | 2742 | 15.5% |
| 1 | 4097 | |
| 0 | 8423 |
status
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 138.3 KiB |
| Dropout | |
|---|---|
| Graduated |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.7942789 |
| Min length | 7 |
Characters and Unicode
| Total characters | 137873 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Graduated |
|---|---|
| 2nd row | Graduated |
| 3rd row | Graduated |
| 4th row | Graduated |
| 5th row | Graduated |
Common Values
| Value | Count | Frequency (%) |
| Dropout | 10664 | |
| Graduated | 7025 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| dropout | 10664 | |
| graduated | 7025 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 21328 | |
| r | 17689 | |
| u | 17689 | |
| t | 17689 | |
| a | 14050 | |
| d | 14050 | |
| D | 10664 | |
| p | 10664 | |
| G | 7025 | 5.1% |
| e | 7025 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 120184 | |
| Uppercase Letter | 17689 | 12.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 21328 | |
| r | 17689 | |
| u | 17689 | |
| t | 17689 | |
| a | 14050 | |
| d | 14050 | |
| p | 10664 | |
| e | 7025 | 5.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 10664 | |
| G | 7025 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 137873 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 21328 | |
| r | 17689 | |
| u | 17689 | |
| t | 17689 | |
| a | 14050 | |
| d | 14050 | |
| D | 10664 | |
| p | 10664 | |
| G | 7025 | 5.1% |
| e | 7025 | 5.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 137873 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 21328 | |
| r | 17689 | |
| u | 17689 | |
| t | 17689 | |
| a | 14050 | |
| d | 14050 | |
| D | 10664 | |
| p | 10664 | |
| G | 7025 | 5.1% |
| e | 7025 | 5.1% |
| entry_age | from_public_school | fundamental_area | gender | has_college_degree | marital_status | num_approved_credits | num_credits | num_credits_reference_semester | num_disciplines | num_dispensed_credits | num_exams | num_failed_credits | num_locked_credits | num_nonattendance_credits | quota | race | semester_average | shift | status | years_between_high_school_and_college | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| entry_age | 1.000 | 0.050 | 0.196 | 0.053 | 0.120 | 0.303 | -0.243 | -0.156 | -0.177 | -0.137 | 0.155 | -0.081 | 0.076 | 0.034 | 0.045 | 0.104 | 0.049 | -0.131 | 0.243 | 0.163 | 0.839 |
| from_public_school | 0.050 | 1.000 | 0.048 | 0.058 | 0.000 | 0.022 | -0.051 | -0.055 | -0.017 | -0.045 | -0.044 | 0.041 | 0.049 | 0.019 | 0.017 | 0.517 | 0.173 | -0.048 | 0.075 | 0.016 | 0.012 |
| fundamental_area | 0.196 | 0.048 | 1.000 | 0.138 | 0.053 | 0.130 | -0.092 | -0.358 | -0.426 | -0.254 | -0.030 | -0.155 | -0.064 | -0.032 | -0.080 | 0.063 | 0.033 | 0.080 | 0.420 | 0.251 | 0.299 |
| gender | 0.053 | 0.058 | 0.138 | 1.000 | 0.000 | 0.050 | -0.063 | -0.058 | -0.083 | 0.028 | -0.000 | 0.007 | 0.018 | -0.021 | 0.061 | 0.049 | 0.047 | -0.042 | 0.114 | 0.088 | -0.015 |
| has_college_degree | 0.120 | 0.000 | 0.053 | 0.000 | 1.000 | 0.071 | -0.045 | 0.012 | -0.033 | 0.012 | 0.063 | -0.033 | 0.009 | 0.003 | 0.027 | 0.025 | 0.037 | -0.027 | 0.074 | 0.051 | 0.094 |
| marital_status | 0.303 | 0.022 | 0.130 | 0.050 | 0.071 | 1.000 | 0.094 | 0.124 | 0.112 | 0.128 | -0.009 | 0.031 | -0.038 | 0.007 | 0.034 | 0.065 | 0.009 | 0.039 | 0.169 | 0.074 | -0.409 |
| num_approved_credits | -0.243 | -0.051 | -0.092 | -0.063 | -0.045 | 0.094 | 1.000 | 0.324 | 0.324 | 0.215 | -0.117 | 0.099 | -0.616 | -0.160 | -0.600 | 0.045 | 0.028 | 0.747 | 0.155 | 0.525 | -0.169 |
| num_credits | -0.156 | -0.055 | -0.358 | -0.058 | 0.012 | 0.124 | 0.324 | 1.000 | 0.657 | 0.689 | 0.371 | 0.104 | -0.019 | 0.080 | 0.010 | 0.025 | 0.038 | 0.048 | 0.123 | 0.111 | -0.095 |
| num_credits_reference_semester | -0.177 | -0.017 | -0.426 | -0.083 | -0.033 | 0.112 | 0.324 | 0.657 | 1.000 | 0.423 | 0.041 | 0.093 | -0.037 | 0.033 | -0.002 | 0.058 | 0.021 | 0.054 | 0.386 | 0.214 | -0.128 |
| num_disciplines | -0.137 | -0.045 | -0.254 | 0.028 | 0.012 | 0.128 | 0.215 | 0.689 | 0.423 | 1.000 | 0.387 | 0.032 | -0.086 | 0.102 | 0.026 | 0.022 | 0.040 | 0.138 | 0.072 | 0.058 | -0.077 |
| num_dispensed_credits | 0.155 | -0.044 | -0.030 | -0.000 | 0.063 | -0.009 | -0.117 | 0.371 | 0.041 | 0.387 | 1.000 | -0.105 | -0.109 | 0.036 | -0.012 | 0.033 | 0.045 | 0.067 | 0.043 | 0.040 | 0.201 |
| num_exams | -0.081 | 0.041 | -0.155 | 0.007 | -0.033 | 0.031 | 0.099 | 0.104 | 0.093 | 0.032 | -0.105 | 1.000 | 0.302 | -0.074 | -0.195 | 0.049 | 0.040 | -0.209 | 0.079 | 0.021 | -0.088 |
| num_failed_credits | 0.076 | 0.049 | -0.064 | 0.018 | 0.009 | -0.038 | -0.616 | -0.019 | -0.037 | -0.086 | -0.109 | 0.302 | 1.000 | -0.015 | 0.218 | 0.057 | 0.026 | -0.700 | 0.192 | 0.352 | 0.021 |
| num_locked_credits | 0.034 | 0.019 | -0.032 | -0.021 | 0.003 | 0.007 | -0.160 | 0.080 | 0.033 | 0.102 | 0.036 | -0.074 | -0.015 | 1.000 | 0.103 | 0.006 | 0.020 | -0.065 | 0.056 | 0.104 | 0.028 |
| num_nonattendance_credits | 0.045 | 0.017 | -0.080 | 0.061 | 0.027 | 0.034 | -0.600 | 0.010 | -0.002 | 0.026 | -0.012 | -0.195 | 0.218 | 0.103 | 1.000 | 0.015 | 0.024 | -0.632 | 0.107 | 0.369 | 0.005 |
| quota | 0.104 | 0.517 | 0.063 | 0.049 | 0.025 | 0.065 | 0.045 | 0.025 | 0.058 | 0.022 | 0.033 | 0.049 | 0.057 | 0.006 | 0.015 | 1.000 | 0.384 | -0.018 | 0.131 | 0.079 | 0.187 |
| race | 0.049 | 0.173 | 0.033 | 0.047 | 0.037 | 0.009 | 0.028 | 0.038 | 0.021 | 0.040 | 0.045 | 0.040 | 0.026 | 0.020 | 0.024 | 0.384 | 1.000 | 0.052 | 0.050 | 0.041 | -0.019 |
| semester_average | -0.131 | -0.048 | 0.080 | -0.042 | -0.027 | 0.039 | 0.747 | 0.048 | 0.054 | 0.138 | 0.067 | -0.209 | -0.700 | -0.065 | -0.632 | -0.018 | 0.052 | 1.000 | 0.117 | 0.516 | -0.059 |
| shift | 0.243 | 0.075 | 0.420 | 0.114 | 0.074 | 0.169 | 0.155 | 0.123 | 0.386 | 0.072 | 0.043 | 0.079 | 0.192 | 0.056 | 0.107 | 0.131 | 0.050 | 0.117 | 1.000 | 0.137 | -0.131 |
| status | 0.163 | 0.016 | 0.251 | 0.088 | 0.051 | 0.074 | 0.525 | 0.111 | 0.214 | 0.058 | 0.040 | 0.021 | 0.352 | 0.104 | 0.369 | 0.079 | 0.041 | 0.516 | 0.137 | 1.000 | -0.093 |
| years_between_high_school_and_college | 0.839 | 0.012 | 0.299 | -0.015 | 0.094 | -0.409 | -0.169 | -0.095 | -0.128 | -0.077 | 0.201 | -0.088 | 0.021 | 0.028 | 0.005 | 0.187 | -0.019 | -0.059 | -0.131 | -0.093 | 1.000 |
| entry_age | gender | race | marital_status | quota | from_public_school | has_college_degree | shift | years_between_high_school_and_college | fundamental_area | num_disciplines | num_credits | semester_average | num_approved_credits | num_dispensed_credits | num_failed_credits | num_nonattendance_credits | num_locked_credits | num_credits_reference_semester | num_exams | status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 29.0 | Male | White | Single | OC | No | No | Morning_Afternoon | 12.0 | Agricultural | 7 | 24 | 7.47 | 24 | 0 | 0 | 0 | 0 | 28 | 2 | Graduated |
| 1 | 17.0 | Male | White | Single | OC | No | No | Morning_Afternoon | 1.0 | Agricultural | 7 | 24 | 8.26 | 24 | 0 | 0 | 0 | 0 | 24 | 0 | Graduated |
| 2 | 21.0 | Female | White | Single | OC | No | No | Morning_Afternoon | 5.0 | Agricultural | 6 | 22 | 8.18 | 22 | 0 | 0 | 0 | 0 | 24 | 0 | Graduated |
| 3 | 21.0 | Male | White | Single | OC | No | No | Morning_Afternoon | 4.0 | Agricultural | 7 | 24 | 8.14 | 24 | 0 | 0 | 0 | 0 | 24 | 1 | Graduated |
| 4 | 17.0 | Male | White | Single | OC | No | No | Morning_Afternoon | 1.0 | Agricultural | 7 | 24 | 8.74 | 24 | 0 | 0 | 0 | 0 | 24 | 0 | Graduated |
| 5 | 21.0 | Male | White | Single | OC | Yes | No | Morning_Afternoon | 3.0 | Agricultural | 6 | 22 | 6.48 | 20 | 0 | 2 | 0 | 0 | 24 | 1 | Dropout |
| 6 | 20.0 | Female | Prefer_not_to_declare | Single | OC | Yes | No | Morning_Afternoon | 5.0 | Agricultural | 7 | 24 | 7.99 | 24 | 0 | 0 | 0 | 0 | 24 | 0 | Graduated |
| 7 | 19.0 | Female | White | Single | OC | Yes | No | Morning_Afternoon | 1.0 | Agricultural | 6 | 22 | 8.72 | 22 | 0 | 0 | 0 | 0 | 24 | 0 | Graduated |
| 8 | 20.0 | Male | Mixed_race | Single | L06 | Yes | No | Morning_Afternoon | 3.0 | Agricultural | 7 | 24 | 2.46 | 4 | 8 | 12 | 0 | 0 | 24 | 1 | Dropout |
| 9 | 18.0 | Female | Black | Single | L06 | Yes | No | Morning_Afternoon | 2.0 | Agricultural | 7 | 24 | 0.00 | 0 | 0 | 2 | 14 | 8 | 24 | 0 | Dropout |
| entry_age | gender | race | marital_status | quota | from_public_school | has_college_degree | shift | years_between_high_school_and_college | fundamental_area | num_disciplines | num_credits | semester_average | num_approved_credits | num_dispensed_credits | num_failed_credits | num_nonattendance_credits | num_locked_credits | num_credits_reference_semester | num_exams | status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17679 | 22.0 | Male | White | Single | NaN | Yes | No | Full_time | 3.0 | Literature_Arts | 2 | 7 | 1.00 | 0 | 0 | 7 | 0 | 0 | 23 | 0 | Dropout |
| 17680 | 19.0 | Male | White | Single | NaN | Yes | No | Full_time | 2.0 | Literature_Arts | 6 | 23 | 0.15 | 0 | 0 | 23 | 0 | 0 | 23 | 0 | Dropout |
| 17681 | 20.0 | Female | White | Single | NaN | Yes | No | Full_time | 1.0 | Literature_Arts | 5 | 19 | 6.64 | 19 | 0 | 0 | 0 | 0 | 23 | 0 | Dropout |
| 17682 | 51.0 | Female | White | Married | NaN | Yes | No | Full_time | 18.0 | Literature_Arts | 6 | 23 | 0.00 | 0 | 0 | 23 | 0 | 0 | 23 | 0 | Dropout |
| 17683 | 19.0 | Male | White | Single | NaN | NaN | No | Full_time | NaN | Philosophy_Human | 1 | 4 | 7.00 | 4 | 0 | 0 | 0 | 0 | 20 | 0 | Dropout |
| 17684 | 20.0 | Female | White | Single | L01 | Yes | No | Morning_Afternoon | 2.0 | Health_Biological | 6 | 29 | 8.93 | 29 | 0 | 0 | 0 | 0 | 29 | 0 | Graduated |
| 17685 | 19.0 | Male | White | Single | OC | Yes | No | Morning_Afternoon | 2.0 | Health_Biological | 6 | 29 | 8.80 | 29 | 0 | 0 | 0 | 0 | 29 | 0 | Graduated |
| 17686 | 33.0 | Female | White | Single | NaN | No | No | Full_time | 14.0 | Philosophy_Human | 5 | 20 | 2.44 | 4 | 0 | 16 | 0 | 0 | 20 | 2 | Dropout |
| 17687 | 25.0 | Male | Black | Single | NaN | Yes | No | Morning_Afternoon | 2.0 | Health_Biological | 5 | 21 | 3.62 | 9 | 0 | 12 | 0 | 0 | 21 | 4 | Dropout |
| 17688 | 18.0 | Female | Mixed_race | Single | NaN | Yes | No | Night | 1.0 | Philosophy_Human | 7 | 24 | 7.41 | 22 | 0 | 2 | 0 | 0 | 28 | 1 | Graduated |